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Abstract 

We consider the population dynamics of a set of species whose net- 
work of catalytic interactions is described by a directed graph. The 
relationship between the attractors of this dynamics and the under- 
lying graph theoretic structures like cycles and autocatalytic sets is 
discussed. It is shown that when the population dynamics is suitably 
coupled to a slow dynamics of the graph itself, the network evolves to- 
wards increasing complexity driven by autocatalytic sets. Some quan- 
titative measures of network complexity are described. 
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Complex networks of interacting components are a characteristic feature 
of adaptive systems. Examples include prebiotic chemical evolution which 
produced complex organizations of molecules culminating in a living cell, 
biological evolution which produces complex ecologies with networks of in- 
terdependent species, and economic and social evolution in which webs of 
interacting agents appear spontaneously. It is of interest to understand how 
such webs originate and become more complex. 

In this paper we discuss a model which describes the network in terms 
of a directed graph, and treats it as a dynamical variable. The dynamics 
of the graph is determined by an underlying population dynamics on a fast 
time scale in which certain structures, autocatalytic sets (ACSs) P, play an 
important role. ACSs arise spontaneously in the network and then trigger 
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an increase in its complexity. The main purpose of this paper is to describe 
the mathematical properties of ACSs and explain why they cause an increase 
of complexity of the network in this model. 

Population dynamics on a fixed artificial chemistry 

The system is described by a directed graph with s nodes. Associated with 
each node i is a population yi > which evolves in time according to 

s 

yi = ^Cijyj - (t)yu (1) 
i=i 

where (f) is some function of time and C = (cij), i, j = 1, . . . , s is the adja- 
cency matrix of the graph, i.e., Cij = 1 if there is a directed link from j to i 
and zero otherwise. Links from a node to itself are disallowed; cu = 0. 

can be interpreted as a set of rate equations in an artificial chem- 
istry similar to the models studied in [P@[§||6|. The nodes of the graph 
correspond to molecular species and a directed link from node j to node i 
indicates that species j catalyses the production of species i. Let i be pro- 

duced by a reaction of the type a + 6 — > i, i.e., from the ligation of reactants 
a and b catalysed by j. In the approximation of a well stirred chemical 
reactor with a constant dilution flux (j), the rate of growth of i is given by 
i/i = kf{l + vyj)ah — (pyi, where a and b denote reactant populations, kf is 
the rate constant for the spontaneous reaction, and ly is the catalytic effi- 
ciency . Assuming that the spontaneous reaction is much slower than the 
catalysed reaction, and that the concentrations of the reactants are fixed 
and large, the first term will be proportional to yj: iji = kyj — (pyi, k being 
a constant. Making the further idealization that all catalytic strengths are 
equal the rate equations reduce to (0). 

We are interested in the attractors of the relative population dynamics 
which follows from (|^), namely 

s s 
Xi — ^ ^ CijXj Xi ^ ^ Cj^jXj, (2) 
j=l k,j=l 

where Xi = yi/J2j=iyj- By definition, the relative population vector x = 
(xi, . . . , Xs) belongs to the simplex J = {x G R*|0 < Xi < 1, J2i=i Xi = 1}. 
J is invariant under (|2|) since Cij > 0. Let X = {Xi, . . . , Xg) denote an 
attractive fixed point of (||), = (y^, . . . , y^) (or equivalent column vector) 
a right eigenvector of C with eigenvalue A, and Ai the eigenvalue of C which 
has the largest real part. Then x'^ = /J2j=i is a fixed point of (|^. If 
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Ai is nondegenerate, X = x ^ is the unique asymptotically stable attractor 
of (§) If Ai is degenerate the attractor configuration is still a fixed 
point; now X is a linear superposition of the x^^ that depends on the initial 
condition x(0). The Perron- Frobenius theorem for non-negative matrices 
ensures that Ai is real and > 0. Table 1 shows a few simple graphs and their 
corresponding Ai and attractors. Chains and trees have Ai = 0. Simple 
cycles of any size have Ai = 1 while more complicated structures such as the 
double loop or the eight structure have higher Ai. 

Autocatalytic sets and spectral properties of directed graphs 

An autocatalytic set (ACS) is a subset of the species that contains the 
catalysts for all its members [Q]. In the present context, one can define an 
ACS as a subgraph, each of whose nodes has at least one incoming link from 
a node belonging to the same subgraph. The simplest ACS is a 2-cycle. 
Every cycle is an ACS but the converse is not true. 

(i) If a graph has no cycle then Ai = 0. If C is the adjacency matrix of a 
graph then {C^)ij counts the number of distinct paths of length n from node 
j to node i. If Aj are the eigenvalues of C then A" are the eigenvalues of C". 
If a graph has no cycle then let the length of the longest path between any 
two nodes of the graph be denoted r. Clearly = for m > r. Therefore 
all eigenvalues of C™" are zero. Hence, all eigenvalues of C are zero which 
implies Ai = 0. 

(ii) If a graph has a cycle then Ai > 1. If a graph has a cycle then there is 
some vertex i which has at least one path to itself of length n, i.e. {C^)ii > 1, 
for infinitely many values of n. Since the sum of the diagonal entries of a 
matrix equals the sum of the eigenvalues of the matrix, J2i=i{C'^)ii is equal 
to the sum of the n^^ powers of the eigenvalues of C. Thus the sum of the 
n''^ powers of the eigenvalues is at least 1, for infinitely many values of n. 
Therefore, there exists an eigenvalue with modulus > 1. By the Perron- 
Frobenius theorem, Ai is the eigenvalue with the largest modulus, hence 
Ai > 1 §. 

An ACS is not in general an irreducible subgraph, because there need 
not be a path from every node of an ACS to another. Thus the Perron- 
Frobenius theorem for irreducible non-negative matrices Q does not apply 
to ACSs. Nevertheless, ACSs share some important properties with cycles 
(which are irreducible): 

(iii) An ACS must contain a cycle. Let C be the adjacency matrix of a 
graph which is itself an ACS. Then by definition, every row of C has at least 
one non-zero entry. Construct C by removing all non-zero entries in each 
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row of C except one which can be chosen arbitrarily. Thus C has exactly 
one non-zero entry in each row. Clearly the column vector x = (1, 1, . . . , 1) 
is a right eigenvector of C with eigenvalue 1. This implies that the graph for 
which C is the adjacency matrix contains a cycle. Since the construction of 
C from C involved only removal of some links, it follows that the original 
graph must also contain a cycle. 

(iv) // a graph has no ACS then Ai = 0. This follows from (i) and (iii). 

(v) // a graph has an ACS then Ai > 1. This follows from (ii) and (iii). 
The reason why an ACS is a useful concept in the present context is the 

following property, not true of cycles in general: 

(vi) // a graph has Ai > 1, then the subgraph corresponding to the set of 
nodes i for which xf^ > is an A CS. Renumber the nodes of the graph so 
that xf^ > only for i = 1, . . . ,k. Let C be the adjacency matrix of this 
graph. Since x'*'^ is an eigenvector of the matrix C we have X]j=i '^ij^j^ — 
X^xf^ =^ ^^^^-^ djXj^ = Xixf'. Since xf' > only for i = 1, . . . , /c it follows 
that for each i £ {1, . . . ,k} there exists a j such that Cij > 0. Hence the 
k X k submatrix C = (cij), i,j = 1, . . . ,k has at least one non-zero entry in 
each row. Thus each node of the subgraph corresponding to this submatrix 
has an incoming link from one of the other nodes in the subgraph. Hence 
the subgraph is an ACS. We call this subgraph the 'dominant ACS' of the 
graph. 

(vii) Consider a graph with no cycles and let there be a chain of r links 
in this graph whose successive nodes are labelled i = 1, 2, . . . , r -|- 1. The 
node 1 (to which there is no incoming link) has a constant population yi 
since the r.h.s of (Q) vanishes for i = 1 when cf) = 0. For the node 2, we get 
2/2 = yi, hence 2/2 (i) = ?/2(0) -|- yit ~ t for large t. Similarly it can be seen 
that grows as i'^"^. In general, it is clear that for a graph with no cycles, 
yi ~ for large t when (j) = 0, where r is the length of the longest path 
terminating at node i. Since the dynamics (^) does not depend upon the 
choice of (p, this proves that for a graph with no cycles Xj = for all i except 
the nodes at which the longest paths in the graph terminate. Similarly if a 
2-cycle feeds into another 2-cycle as shown in Table 1, Xi = for the nodes 
in the first 2-cycle. These are examples of the more general situation where 
Xi = for nodes of a subgraph which has another subgraph 'downstream' 
from it with an equal or larger Ai. 

Evolution of the network 

On a short timescale the graph remains fixed and the Xi evolve according 
to (^). On a longer timescale at discretely spaced intervals (labelled by 
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n = 1, 2, . . .) the graph itself changes by the ehmination of an existing species 
and the creation of a new one. At the initial time (n = 0) the graph is 
random: Cij = 1 (for i ^ j) with probability p {p is called the 'catalytic 
probability') and zero with probability 1 — p. ra = p{s — 1) is referred to 
as the average connectivity. The graph at n + 1 is obtained from that at 
n by the following procedure: Determine the 'set of least fit nodes' at n, 
namely, those that have the smallest Xi for the graph at n. Pick a node 
at random from this set, and reassign links between it and all other nodes 
randomly with the same probability p. The idea is that for a fixed graph the 
relative population stabilizes at X. One of the least fit species {Xi is taken 
to be a measure of fitness) mutates or is eliminated. This corresponds to 
an 'extremal dynamics' as in the model of Bak and Sneppen . The graph 
update corresponds to the appearance of the mutant or replacement, which 
has random connections (but with the same average connectivity m as in 
the initial graph) with other species. 

Define si{n) as the number of species i for which Xj 7^ at the n^^ time 
step. For a graph with Ai > 1, si is the number of nodes in the dominant 
ACS. Whenever si < s it can be shown that the set of least fit nodes, which 
is the set of nodes with Xi = 0, is unique and independent of the initial 
condition even if Ai is degenerate. When si = s, Ai has turned out to be 
nondegenerate in the runs displayed. Hence there is no ambiguity in the 
update procedure arising from initial conditions on x. 

Figures 1 and 2 show how Ai and si evolve for a run with s = 100 and 
m = 0.25. For n < ni = 1515, Ai = and there is no ACS in the graph. 
For n < ni, si is below 10 most of the time. At n = ni the graph update 
forms a 3-cycle which becomes the dominant ACS and Ai jumps from zero 
to one. The nodes outside the dominant ACS by definition have xf^ = 
and constitute the set of least fit nodes. Therefore, as long as the dominant 
ACS at step n (let us denote this subgraph as A(n)) does not include the 
whole graph, i.e., si < s, the mutating node will be outside it. At step n + 1 
the mutant species can either (a) get linked to A(n), or (b) form another 
ACS with other nodes which were not part of A(n), or (c) be a singleton 
or part of a non-ACS structure. In all cases Ai(n + 1) cannot be less than 
Ai(n). (For case (a) this depends upon the fact that the mutating node, 
being outside A(n), cannot destroy any of its links.) Thus, whenever si < s, 
Ai is a non- decreasing function of n. It follows that once an ACS is formed 
by chance, the autocatalytic property of the graph will be preserved until 
the dominant ACS engulfs the whole graph. 

After the appearance of the ACS at n = rii, whenever a node gets an 
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incoming link from the dominant ACS it becomes part of the dominant 
ACS. Most of the time this increases si and the dominant ACS grows until 
it spans the entire graph at n = n2 = 3010 when si = s for the first time. 
For n G [ni,n2], si locally averaged in time grows exponentially. 

In the given run a 2-cycle disconnected from the existing dominant ACS 
is created n = 1807. Both the cycles coexist and grow together until n = 
2145 when the graph update creates a chain from the 3-cycle to the 2-cycle. 
Thus a situation occured in which a subgraph with Ai = 1, comprising the 
3-cycle and the nodes being fed by the 3-cycle, was now feeding into the 
2-cycle 'downstream' which also had Ai = 1. Hence all the nodes of the first 
subgraph entered the set of least fit nodes (see remarks under (vii)) and si 
decreased from 15 to 10. Later at n = 2240 the chain joining the cycles was 
broken by the graph update and both cycles became part of the dominant 
ACS. At n = 2607, si decreases from 58 to 31. This time also a chain formed 
between the cycles with the 2-cycle now feeding into the 3-cycle. si rises 
back up to 54 at n = 2611 when this chain too was broken by the graph 
update. There is one more fall in si at n = 2780. The graph update results 
in the 3-cycle being converted to a double loop which has Ai ~ 1.17 which 
overshadows the ACS whose core is the 2-cycle. 

Even in regions where si is increasing the ACS structure could be chang- 
ing quite a lot. Many complicated processes are possible, such as formation 
of new disconnected ACSs or the reinforcement of an old overshadowed ACS 
before it was completely broken up. Thus the actual growth of the ACS is 
usually a very complicated process with the dominant ACS often undergo- 
ing drastic changes in structure caused by purely chance events. We wish to 
emphasize, however, that notwithstanding the historical particularities of a 
given run, every history respects the rule that Ai is a nondecreasing function 
of time (unless si = s) in this simple model. (In a 'non-extremal' dynamics, 
one can expect a stochastic version of this monotonicity, as long as selec- 
tion is still sufficiently strong.) Further, the ensemble of runs with the same 
parameter values m, s has identifiable characteristics like the average time 
of appearance of an ACS = (ni) ~ s/m^ ~ l/(p^s), and the exponential 
growth time scale ~ s/m 1/p (for sufficiently small m and large s) [Q]. 

At n = 71-2 the whole graph becomes an ACS and for the first time 
the graph update will alter a node from the dominant ACS. Then Ai can 
decrease. It eventually settles in a statistical steady state with large fluc- 
tuations in the run shown. For very low m, Ai can even become zero in a 
graph update. 

In addition to Ai, si and the total number of links /, another measure 
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of the complexity of the graph is 'interdependency', denoted d, which we 
define as follows: d = J2i=i di, where di is the 'dependency' of the i^^ 
node, di is the total number of links in all paths that terminate at node 
i, each link counted only once. Since di counts how many links ultimately 
'feed into' the node i, it is a measure of how 'dependent' species i is on other 
species. Thus d is a measure of how interdependent the species in the graph 
are. From Table 1 it is evident that graphs with the same number of links 
can have different d. 

Figure 3 shows d versus n for the same run (the lower curve in the 
figure corresponds to a 'random run' in which the node to be eliminated is 
picked randomly from the entire set of s nodes). The qualitative features are 
similar to I which has been discussed in . A quantitative difference is that 
from n = to the steady state, d increases by a factor of approximately 
50 which is much more than the increase in I (a factor of about 5, see 
[0] ) . The evolution of the system does not seem to lead to maximally dense 
graphs, m* ~ s, but only m* ~ 0(1) (m* is the steady state connectivity). 
Nevertheless the system achieves a high interdependency. A steady state d 
value of about 25 means that each species is being fed with an appreciable 
fraction of the total number of links, suggestive of long range correlations 
having developed in the system. The fluctuations of d in the steady state 
are also more pronounced than for I. At each graph update the change in 
the number of links is 0(1) for fixed m but the addition or removal of a few 
links causes large changes in d, reflecting its nonlocal character. 
Acknowledgement: S. J. acknowledges the affiliation and support of the 
Jawaharlal Nehru Centre for Advanced Scientific Research, Bangalore, as 
well as Associate membership and hospitality of the Abdus Salam Interna- 
tional Centre for Theoretical Physics, Trieste. 

References 

[1] S. Jain and S. Krishna, Autocatalytic sets and the growth of com- 
plexity in an evolutionary model, preprint no. IISc-CTS/9/98, \adap- 
I org/980900^ 

[2] S. A. Kauffman, J. Theor. Biol. 119, 1 (1986); The Origins of Order 
(Oxford University Press, Oxford, 1993). 

[3] J. D. Farmer, S. A. Kauffman and N. H. Packard, Physica D22, 50 
(1986); R. J. Bagley and J. D. Farmer, in Artificial Life II, edited by 



7 



C.G. Langton et al., (Addison Wesley, Redwood City, 1992), p. 93; R. 
J. Bagley, J. D. Farmer and W. Fontana, in Artificial Life II, p. 141. 

[4] W. Fontana, in Artificial Life II, p. 159; W. Fontana and L. W. Buss, 
Bull. Math. Biol. 56, 1 (1994). 

[5] R F. Stadler, W. Fontana and J. H. Miller, Physica D63, 378 (1993). 

[6] D. Segre, D. Lancet, O. Kcdem, and Y. Pilpel, Origins of Life and 
Evolution of the Biosphere 28, 501 (1998); D. Segre and D. Lancet, in 
Chemical Evolution: Exobiology, An Abdus Salam Memorial, edited by 
J. Chela-Flores and F. Raulin (Kluwer, Dordrecht, The Netherlands, 
1998) p. 123. 

[7] The above proofs of statements (i) and (ii) were supplied to us by V. 
S. Borkar and R. Hariharan, respectively. 

[8] E. Seneta, Non-Negative Matrices (George Allen and Unwin, London, 
1973). 

[9] P. Bak and K. Sneppen, Phys. Rev. Lett. 71, 4083 (1993). 



1.5 



1 



0.5 



0, 




1000 2000 3000 4000 5000 6000 7000 
n 

Figure 1: Ai versus n for s = 100, m = 0.25. 
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Figure 2: si versus n for s = 100, m = 0.25. 
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Figure 3: Interdependency versus n for s = 100, m = 0.25. The lower curve 
is for a random run with s = 100, m = 0.25. 
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Table 1: Adjacency matrices, largest eigenvalues , the corresponding eigen- 
vectors and interdependency of some simple graphs 
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